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PEPTIDE-MEDIATED GENE TRANSFER 



BACKGROUND OF THE INVENTION/ 
The present invention is directed to a methodology for 
highly efficient stable integration of DNA into a fcukaryotic 
genome. More sp-cifically. the present invention 7 is directed 
to the use of a synthetic polypeptide, containing a nuclear 
localization signal, to complex with a DNA rodecule and to 
facilitate its transportation and integration into the nuclear 
geoome of a mammalian or other eukaWotic celL for 
example, in the context of producing ceUV lines with an 
extended life. 

DNA-CaP0 4 co-precipitation was the ft/st method devel- 
oped to introduce genes into rnammaiian/cells. (''Gene" in 
this regard denotes a structural DNA sequent. Lc. a DNA 
that codes for a polypeptide, and comprehends oncogenes as 
well as DNAs coding for a known expression product.) The 
co-precipitation method was applicable only to certain cell 
types, however, and could not be us/d to introduce genes 
Lnto a wide variety of cell lines, especially those of hemato- 
poietic origin. Moreover, the stable feene transfer efficiency 
was rather low. on the order of lCT* £ 10**. McNally. M. A., 
et. ai. BioTcchniqucs 6: 8826 (1 
loc. cit. 6:413 (1988). 

Limits on introducing and e 
mammalian cells motivated a s 
approaches to gene transfer. M 
example, thai utilized chcrai 
tively charged and. hence, ab' 



S8); Yen. T. S. B.. et ai. 

fressing genes in cultured 
h for other, more efficient 
tods were developed, for 
agents which were posi- 
to complex with negatively 



charged DNA molecules. Exiraples of such agents include 
DEAE dcxtran and variousycalionic lipid molecules. Cells 
treated with DNA complexes comprised of such an agent 
can lead to the introduction of the DNA into different 
mammalian cell lines. Mainino. R- J. ct. al.. BioTechniques 
6: 682 (1988); Feigner. P/et al.. Proc. Hat'l Acad. Set. USA 
84* 74 13 ( 1987); Fraley. R. et. al. Trend Biochcnu Set. 6: 77 
(1981); Holier. W. et. a/. Ejp. Cell Res. 184: 546 (1989); 
McCutchan.J. H.etal.iT. Noi l Cancer Inst. 41: 351 (1986); 
Chaney. W. C. et aL. Somatic Cell dc Mol Genet. 12: 237 
(1986). 

The production of/a gene product for only a short time 
period after transfection. usually from 48 to 72 hours, is 
called "transient /expression." Many of . the DNA- 
complexing agent*/ reported heretofore, while useful in 
transferring a gen/ into raarnmalian cells, resulted in only 
transient expression of the introduced gene in a small 
fraction of the transfected cells. See. for example. Miller et. 
al.. Proc. Nafl Acad. ScU USA. 76: 949 (1979); Oi el aL 
loc cit. 80: 825/(1983). 

In addition t6 giving poor results with respect to stable 
gene cxpressido. transfer methods based on such DNA- 
complexing aients often were effective only with estab- 
lished cell links, and did not work very well with primary 
cells isolated/from various mammalian species. Other tech- 
niques therefore were needed to enhance gene transfer 
efficiency, tf increase the variety of cell types capable of 
being transacted, and to effect stable gene transfer. Stable 
gene transfer is the ability of cells to maintain and express 
transfected DNAs in a stable manner, through integration of 
the transfected DNA into cell chromosomes. 



VB^xovital vectors, which were under development^HF 

aC?ut the same time seemed to be quite effective in trans- 
ferring genes into different cell types. The use of such 
vectors was prompted by the elucidation of geoe regulation 
in various murine and avian retroviruses. Two other devel- 
opments led to the development of retrovinis-based gene 
transfer vehicles. The first development was the identifica- 
tion of minimal sequences required for efficient packaging 
of viral particles in a cell lioe which produced the coat 
proteins and other structural components of the viral particle 
in trans. The cell lines that provided the structural compo- 
nents for vims development are called packaging" cell 
lines. The second significant step in the establishment of 
retroviral vectors was the development of both ecotropic and 
amphotropic packaging cell lines, which aided the design of 
recombinant retroviral particles which could infect both 
murine and human cell lines. 

Additional modifications of retroviruses were deemed 
necessary to address concerns that retroviral vectors could 
recorabine in vivo to generate wild-type virus. Develop- 
ments in this regard yielded a number of safe retroviral 
vectors which have been used to transfer genes into a variety 
of established roammalian cell lines, as well as into certain 
primary cells in a few instances. E. Gilboa et aL. BioTech- 
niques 4: 50* (1986); A. D. Miller et al.. MoL CelL BioL 6: 
2895 (1986); H. Stuhlmann et aL. loc. ciL 9: 100 (1989); A. 
D. Miller et aL, BioTechniques 7: 980 (1989); J. A. Zwiebel 
et aL. Science 243: 220 (1989). 

Even though these vectors were effective with respect to 
various mammalian cells, there were many restrictions on a 
wider application of the retroviral gene-transfer technique. 
These limitations included (1) the size of exogenous DNA 
that can be inserted into a retroviral vector and (2) the use 
of only dividing cells for retroviral gene transfer. E. Gilboa. 
BioTechniquts. supra (1986); A. D. Miller, supra ( 1986); H. 
Stuhimann. et al.. MoL CetL BioL supra. (1986); A. D. Miller 
et aL. BioTechniques, jupra (1986); J. A. Zwiebel et al.. 
supra (1989). 

Other viruses have been used to generate recombinant 
viral vectors for gene transfer studies. Adenovirus, adeno- 
associated virus, herpes simplex virus, and even HTV have 
been employed as vectors to introduce genes into both 
established ceU lines and primary cells. Some of these viral 
vectors are capable of transferring genes into non-dividing 
cells R. J Samulski. et al., EMBO J. 10: 3941 (1981); J. D. 
Tratschin. et aL. MoL CeU. BioL 5: 3251 (1985): P. L. 
Hermonat et aL. Proc. Not'l Acad. ScL {USA) 81: 6466 
( 1984); D. J. Fink, et aL. Human Gene Therapy 3:11 (1992). 

Viral vectors capable of transferring genes into non- 
dividing cells usually require the generation of high-titer 
viral stock in order to achieve high efficiency gene transfer 
into different cell types. In addition, whenever a different 
regulatory sequence is to be tested for optimal level of gene 
expression into primary cells, a new viral stock must to be 
made and tittered for every modification. All these involve 
very time-consuming experimental manipulations. 

Still another concern relates to the application of viral 
vectors in human gene therapy. A number of studies have 
been carried out in primates to test the safety of retroviral 
vectors for introducing cells transduced with retroviral vec- 
tors into animals. Some of these animals have developed 
various forms of lymphoma. R. E. Donahue, et al.. J. Exp. 
Med. 176: 1 125 (1992). Additional safety features have been 
introduced into some of the newer versions of retroviral 
vectors, yet arc not available for all types of viral vectors. 



SUMMARY OF THE INVENTION 

It therefore is an object of the present invention to provide 
a method for high efficiency gene transfer to achieve 
expression, stable as well as transient, in a wide spectrum of 
cell types, including primary cells from various mammalian 
species. 

It is also ao object of the present invention to provide cell 
lines which, even if derived from primary mammalian cells, 
arc chararteri2cd by an extended life in culture:; 

It is another object of the present invention to provide a 
readily implemented screening system for identifying 
sequences that influence in the expression of cloned genes in 
various primary cell types from different species. 

In accomplishing these and other objectives, there has 
been provided, in accordance with one aspect of the present 
invention, a transfection vector comprising a synthetic 
polypeptide linked electrostatically to a DNA structural 
sequence, forming a polypeptide- DNA complex, where the 
polypeptide is comprised of (A) a polymeric chain of basic 
amino add residues. (B) an NLS peptide and (C) a hinge 
region of neutral amino acids that connects the polymeric 
chain and the NLS peptide. The polymeric chain preferably 
is comprised of between 10 and 50 residues, winch can 
selected from lysine, argininc and ornithine, for example, 
while the hinge region is comprised of between 6 and 50 
amino acid residues selected, for example, from glycine, 
alanine, leucine and isoieucinc. The NLS peptide preferably 
is located at the amino terminus of said polypeptide and the 
polymeric basic amino acid chain at the carboxyl terminus. 
Among exemplary NLS peptides are the SV40 large T 
antigen NLS sequence, the polyoma large T antigen NLS 
sequence, the adenovirus El a NLS sequence, and the aden- 
ovirus Elb NLS sequence. 

In accordance with another aspect of the present invention 
an extended life cell line is provided that is the product of 
o-ansfecting a mammalian cell with a vector as described 
above. The mammalian cell thus transfected can be selected 
for example, from the group consisting of a human umbilical 
vein endothelial cell, a human dermal microvascular endot- 
helial cell, a human peripheral blood rnonocyte/macrophage 
cell, a human aortic smooth muscle cell, and a rabbit liver 
noo -parenchymal cell. 

Other objects, features and advantages of the present 
invention will become apparent from the following detailed 
description. It should be understood, however, that the 
detailed description and specific examples, while indicating 
preferred embodiments of the invention, are given by way of 
illustration only, since various changes and modifications 
within the spirit and scope of the invention will become 
apparent to those skilled in the art from this detailed descrip- 
tion. 
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DETAHJED DESCRIPTION OF PREFERRED 
EMBODIMENTS 

The present invention allows for the highly efficient 
transfer and stable integration of DNA into eukaryotic cells, 
such as cells from established mammalian cell lines, primary 
cells from mammalian tissues, and plant cells. The present 
invention also can be applied to developing cell lines from 
non-dividing cells, such as human peripheral blood mono- 
cytes and macrophages. 

In accordance with the present invention, a synthetic 
polypeptide is provided that can complex with a DNA 
molecule very efficiently by taking advantage of the high 
negative charge density on the polynucleotide. To this end. 
a synthetic polypeptide of the present invention composes a 
DNA-binding sequence that is rich in basic amino acids, 
such as lysine, arginine and ornithine, and that is typically 
ten to fifty residues long. r>isoraers of these basic amino 
acids are suitable so long as the length of the stretch of basic 
amino acids is within the prescribed length. The DNA- 
binding sequence can be a homopolymer of a basic amino 
acid or it can comprise more than one kind of basic residue. 
The DNA binding sequence must be of adequate length to 
bind DNA. yet not so long thai it precipitates out of the 
solutions employed in the present methodology, as discussed 
below. 

A synthetic polypeptide of the present invention also 
contains an amino acid sequence corresponding to a nuclear 
localization signal (NLS) sequence. A representative sample 
from the diverse range of nuclear on signab which 

have been identified are listed in Table I below. (SEQ ID 
NOS:l-54). 
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TABLE 1 -continued 



Rabbit 


Progesterone receptor 


HIV-I 


Rev 
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PDCF A-chain 
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(SEQ ID NO: 27) R— K — F — K — K 
(SEQ ID NO: 28) R— R — N — R — R — R — R — W 
(SEQ ID NO: 29) P— R-3-S — G — K — K — R — K — R — K — R- 
(SEQ ID NO: 30) K— K — K — K — K 
(SEQ ID NO: 3i) P— P— K— K— R . 
(SEQ ID NO. 32) P— K — K — K — K— K 
(SEQ ID NO: 33) S — K— R — V — A — K — R — K — L 
(SEQ ID NO: 34) P—L — L — K— K — I — I — Q 
(SEQ ID NO: 35) P — P — Q — K— K— I — K— S 
(SEQ CD NO: 36) P— Q — P — K— K — K — P 
(SEQ ID NO: 37) F — K — R — K — H — K — K 
(SEQ ID NO: 38) S — K— C — L—G — W— L — W — G 
(SEQ ID NO: 39) G — K — R— K— N— K — P— K 
(SEQ ID NO: 40) K— T — R— K — H— R— G 
(SEQ ID NO: 41) K— H— R — X— H — P — O 
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^— K — R— A—V — R— R 



Protein 



Nuclear Localization Signals 



TOA-IA (tobacco) 
TGA-IB (tobacco) 

02 NLS B (maize) 

NIa fPotyvirus) 

VI- D 2 (A^robactrrium) 

VVE2 NSE1 (Agrobacterium) 

VirE2 NSE2 (Agrobacteriutn) 
02 NLS A (maize) 

R NLS A (maize) 
R NLS M (maize) 
RNLSC (maize) 



(SEQ ID NO: 42) K£S — L~A— 0 — N— R — E — A— A— S — R- 
(SEQ ID NO: 4 3 ) — A — R — L — V — R — N — R — ] 

(SEQ ID NO: 44)»-Q- gg%gB 
(SEQ ID NO: 45) 

(SEQ ID NO: 46)B3-N-~ Q— K— H— K-L— K— MOZAaSHSjgg 
(SEQ ID NO. 47)§§§-p — R — E— 

D — R — Y — I — Q — T — E— {S3— Y 





(SEQ ID NO: *8 ffi^-R— P- 

(SEQ ID NO: *9) K — T — K — ' 
(SEQ ID NO: 50) M — E — E — A — V — T — M— A — P — A — A — V — S — S — A — V — V- 

(SEQ ED NO: 51) M-3-Y — N — A— 1 — L— ^ — E — E — D — L — E 
•A — A — P — A- 



-D — P 



(SEQ ID NO: 52) ( 
(SEQ ID NO: 53) M — S — — K — L 
(SEQ ID NO: 54) M — ( — S — E — A — L— Bs§B — A- 1 -O- 



See Garcia-Bustos ct al.. Biochenv Biophys. Acta 1071: 83 
(1991). Raikhel. N.. Plant Physiol. 100: 1627 (1992). and 
Citovsky. V. et al. Science 256: 1S02 ( 1992). the contents of 
each of which are hereby incorporated by reference. 

In the present invention, an NLS peptide, which typically 
is six to fifteen amino acids in length, facilitates transport of 
the associated DNA into the nucleus. Because the synthetic 
polypeptide promotes the transport of the transfected gene 
into the nucleus of the host celL this method provides both 
highly efficient stable and transient gene expression. Once 
inside the nucleus, the introduced DNA is immediately 
available to the transcription machinery, and can be 
expressed transiently. Simultaneously, the introduced DNA 
is also in the process of getting integrated into the host 
chromosome to give rise to stable expression. Thus, the 
method of the instant invention can achieve both transient 
and stable expression of introduced DNA. 

Transient gene expression results when the method of 
gene transfer results in the introduction of the DNA 
sequences into the nucleus in an non-integrated form. Tran- 
sient transfection is measured 24 to 72 hours after transfec- 
tion by assays that measure gene expression of the trans- 
fected gene(s). In contrast, stable expression of the encoded 
protein results when the transferred DNA sequences axe 
stably integrated into the chromosomal DNA of the target 
cell. Stable transfectaou remain capable of expressing the 
transfected DNA after two weeks or greater following the 
method of the invention. Commonly used assays monitor 
enzyme activities of chloramphenicol acetyl transferase 
(CAT). LAC-Z. P-galactosidasc (P-gal). ^ glucuronidase 



(GUS). lucif erase, or human growth hormone, each of which 
may be contained in the present invention. 

The NLS domain of the synthetic peptide is based on 
known endogenous peptide sequences that were identified 
by reference lo two criteria: (1) sufficient to redirect a 
cytoplasmic protein to the nucleus and (2) necessary for 
directing a nuclear protein to the nucleus. Methods for 
assessing an NLS peptide's ability to direct protein to the 
nucleus are known in the art. See Garcia-Bustos, el al. 
supra. Sandler ct al.. J. Cell BioL 109: 2665 (1989). and 
Citovsky et al. supra, the respective contents of which are 
hereby incorporated by reference* For example, an NLS 
peptide or a natural protein containing an NLS is fused to an 
otherwise non-nuclear protein, by either synthetic or recom- 
binant production. The hybrid protein is then assessed for its 
ability to target the non-nuclear protein to the nucleus. 

The presence of the non-nuclear protein in the nucleus can 
be determined by a functional assay or iniraunoflucrescence. 
An illustrative assay entails the histochemical determination 
of a product produced by the non-nuclear protein, such as a 
colorimetric marker produced by fj-gal or GUS. (A "colo- 
rimctric marker** includes an enzyme that can catalyze a 
reaction with a substrate to elicit a colored product which 
can be detected or measured by a variety of means, such as 
standard fluorescence microscopy, flow cytometry, spectro- 
photometry or colorimetry. •'Immunofluorescence'* relates to 
detecting the presence of the non-nuclear protein in the 
nucleus by means of an antibody specific for the targeted 
protein.) 

In the past NLS peptides have been studied to assess their 
ability to target reporter proteins to the nucleus. Also, 
endogenous proteins containing an NLS. such as the VirD2 
and VirE2 of Agrobacterium. have been shown to mediate 
the transfer of the Agrobacterium single-stranded DNA 
intermediate T-strand to the plant cell nucleus eodogcnously. 
See Citovsky. et al.. supra. There has been no suggestion 
heretofore, however, to use an NLS peptide to target a 
polynucleotide to the nucleus of a eukaryotic cell. 

A preferred NLS domain contains a short stretch of basic 
amino acids like the NLS of the SV40 virus large T antigen 
(PKKKRKV) (SEQ ID NO:3). which is an NLS that has 
been shown to be effective in mammalian cells (basic 
residues are highlighted). Another preferred NLS domain 
consists essentially of short hydrophobic regions that con- 
tain one or more basic amino acids (KIPDC) (SEQ ID NO: 1). 
which is like the NLS of mating type ct2. The NLSs that 
transport DNA into the plant cell nucleus often are bipartite, 
which means that they are usually comprised of a combi- 
nation of two regions of basic amino acids separated by a 
spacer of more than four residues (see stippled segments in 
Table I), such as the Xenopus nucleoplasm^ 
(KRPAATKKAGQAKKKK) (SEQ ID NO:55). 

The NLS peptide of the present invention can be designed 
to accommodate different host cells, both mammalian and 
plant cell hosts. 

The method described here can suitably be modified to 
introduce genes into plant protoplasts using plant NLSs. 
such as those described by Raikhel (1989). supra. 

The present gene transfer system is also capable of 
transferring foreign DNA into gymnosperms and 
angiospenns. Procedures for assessing the introduction of 
foreign DNA in plants are known to the art. such as those 
disclosed by MikL B. L., et al.. in METHODS IN PL^\NT 
MOLECULAR BIOLOGY AND BIOTECHNOLOGY. B. 
R. Glick et al. eds. (CRC Press. 1993). and Gruber. M. Y. 
et al.. id. 



synthetic polypeptide of the present invention 
comprised of a DNA binding domain and an NLS peptide 
domain which are separated by a third element a hinge 
region of neutral amino acid, to minimize stearic interfer- 
ence between the two domains. For this purpose, the hinge 
region ranges in length from about six to twenty-five amino 
acids, and contains a stretch of neutral small amino acids 
without any bulky hydrophobic or ionic side chains. 

The NLS sequence can be located at either the amino 
terminus or the carboy-terminus of the synthetic peptide, 
The arrangement of the two domains, basic amino acid 
sequence and NLS sequence can be interchanged without 
affecting the high gene transfer efficiency. As indicated 
previously, such a synthetic polypeptide binds electrostati- 
cally to the DNA that is to be introduced into the target celL 
The weight ratio of polypeptide to DNA in the resulting 
complex generally is in the range of 1 :1 to 1: 10; for example. 
1 polypeptide to 1 to 10 ug of polynucleotide. 

In accordance with the present invention, entry of the 
DNA-polypeptide complex into cells can be promoted by 
treating target cells with a hypertonic solution, followed by 
hypotonic treatment of cells in the presence of gene- peptide 
complex. See, for example. Obda and Rechsteiner, Cell 29: 
33 (1982). A suitable hypertonic solution can contain both 
polyethylene glycol (PEG) and sucrose, preferably in the 
concentration of 0.3M-0.6M and 10% to 25%. respectively, 
and is referred to as **primer" hereinafter. Olcada et aL. supra. 
and T. Tabu, et al. Biochcm. Biophys. Acta 104S: 105 
(1990). 

The methodology of the present invention has been used 
to develop stable transfectants of different established cell 
lines. It also has been employed to transfer genes into 
primary cells from different mammalian species, thereby to 
obtain cell lines that retain many of the characteri sties of the 
cognate primary cells. Cell lines developed from primary 
cells via the methodology of the present invention are called 
"extended life" cell lines in this description, because the cell 
lines so developed retain almost all of the characteristics of 
their cognate primary cells even in their late passage. The 
range of cell types that can be converted to extended life cell 
lines, according to the present invention, is based on the 
availability of primary cells or the ability to isolate a primary 
cell from the organ in question. In this regard, the inventive 
methodology is not Limited to cell types amenable to trans- 
formation. In addition to the cell types already mentioned, 
the present invention can be applied to pancreatic beta cells, 
human liver and kidney cells, and human hematopoietic 
stem cells, among others. 

The methodology of the present invention has been used 
to develop an extended life cell line from human monocyte/ 
macrophage cells, which are normally non-dividing. In all 
these instances, stable cell lines were obtained with a very 
high efficiency, either comparable to or better than the 
efficiency using retroviral vectors. 

The present invention finds application as well in both ex 
vivo and in vivo gene therapies, where genetic material is 
transferred into specific cells of a patient Ex vivo gene 
therapy entails the removal of the relevant target cells from 
the body, transduction of the cells in vitro, and subsequent 
reintroduction of the modified cells into the patient. 

A gene therapy pursuant to the present invention could 
involve an ex vivo introduction, into a particular cell type 
from the patient, of a pol nucleotide coding for a correcting 
protein which can be produced in functional form by the 
targeted cell type. Genes suitable for expression in this 
regard include an adenosine deaminase gene, a globin gene, 
an LDL receptor gene, and a glucose cerebrosidase gene. 




^^biffereot kinds of gene -therapy applications require < 
stable or transient geoe expression. The method of the 
present invention is advantageous in that it can be used in 
gene therapy requiring either stable gene expression or 
transient gene expression. Transient expression of a foreign 
geoe is preferred when expression of the exogenous product 
is needed only for a short period of time; thereafter, rapid 
clearance of the gene product and its vector is desirable. 
Transient expression is also desirable when the prolonged 
effects of the exogenous protein's expression are unknown. 
Stable expression in gene therapy is needed when the patient 
has a genetic defect that is incompatible with life. Such 
genetic defects include but are not limited to cystic fibrosis. 
Tay Sachs and cancer. Mulligan. Science 260: 926 (1993). 

A gene therapy pursuant to the present invention also 
could involve an in vivo introduction of a structural DNA 
into cells of a patient's body. For stable transfer of genes into 
a target tissue using this method, the ligand to the target 
receptor will be conjugated to the synthetic polypeptide. The 
pdypepude-ligand combination can be complex ed to a 
polynucleotide coding for the needed protein and then 
introduced into the host organism through blood circulation. 
When this complex reaches the target tissue, the whole 
complex will be taken up by cells containing the correspond- 
ing receptor for the ligand through receptor roe dialed pro- 
cess. Because of the NLS io the polypeptide- ligand 
complex, the complex will enter into the nucleus, resulting 
in a stable integration of the introduced gene into the host 
chromosome and. thereby, a correction of the genetic defect 
in the host Cell-specific receptors are well known to those 
of skill in the art. as are their ligands which can be used in 
complexes for receptor-mediated gene transfer. Michael. S. 
L. et ai.. /. Biol. Chem, 268: 6S66 (1993). For example, 
when the liver is the tissue targeted for gene therapy, the 
DNA encoding corrective protein is coraplexed to a syn- 
thetic neoglycoprotein that will target the complex to the 
asialoglycoprotein receptor on hepatocytes. For example, a 
cell type specific receptor such as asialoglycoprotein can be 
chemically linked to the transfection vector at the carboxyl 
terminal of the synthetic polypeptide molecule to deliver the 
foreign gene directly into liver cells. An additional hinge 
region can be incorporated into the molecule before chemi- 
cally linking the polypeptide molecule to a cell-type specific 
ligand molecule, such as asialoglycoprotein or a cell-specific 
monoclonal antibody. 

An example of a carrier useful for receptor-mediated gene 
transfer to liver is a synthetic glycoprotein in which bovine 
serum albumin (BSA) is covalently bound to poly L-lysinc 
using l-ethyl-3-(3-dimethyIaminopropyl) carbodiimidc 
(EDC). Ferkol ct al.. FASEB 7: 1081 (1993). To produce a 
neoglycoprotein conjugate for use in targeting DNA to liver, 
a reaction mixture thai contains about 170 raM galactose. 4 
mM poly (L-lysine). 160 mM BSA and 10 mM EDC (pH 
7.5) can be incubated for 48 hours at 22° C. DNA is 
coraplexed to the neoglycoprotein carrier in a 360:1 molar 
ratio. The carrier- DNA complexes arc dialyzed against 150 
mM sodium chloride before transfection. 

Expression of a functional protein after transfection with 
DNA coraplexed to ligand alone is often transient Ferkol et 
al.. supra. The method of the present invention greatly 
improves the cell-specific targeting of receptor- mediated 
transfection by providing stable expression by increasing 
stable integration of a foreign DNA in the host cell using a 
synthetic polypeptide molecule of the present invention. 
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cKpling a synthetic polypeptide as described above to 
monoclonal antibodies which recognize a cell surface anti- 
gen on the target cells. Maruyama et al.. Prvc. NaflAcad. 
ScL USA 87: 5744 ( 1990). The coupled monoclonal antibody 
and synthetic polypeptide then are complexed with a DNA 
encoding the required or desired protein. This complex will 
target the DNA to the cells expressing the corresponding cell 
surface antigen. Any tissue of the human body can be 
targeted for the gene therapy of the present invention using 
the disclosed methods. A target tissue is suitable in this 
context so long as it is susceptible to genetic modification 
according to the present invention. 

The present invention is further described with reference 
to the following examples, which arc only illustrative and 
not limiting of the invention. 

EXAMPLE 1 

Transfer of Genes into Established Cell Lines for 
Purposes Transient Gene Expression and Selection 
of Stable Transfectants, Respectively 

Forming the DNA-Polypeptide Complex and 
Transecting Cells Therewith 
The DNA or other polynucleotide to be transfected, such 
as a plasmid containing a gene for a drug resistance marker 
or coding a protein needed for expression in the host cell, is 
complexed to a synthetic polypeptide molecule in different 
weight ratios in an isotonic buffer solution. For example the 
weight ratio of DNA: polypeptide can be between 1:1 and 
10:1, although ratios outside of this range may be evaluated 
empirically for achieving the objects of the present inven- 
tion. An isotonic buffer solution such as Hanks buffered salt 
solution or HEPES buffered saline may be used for com- 
pleting DNA to polypeptide. 

While the complex is formed, the cells that are to be 
transfected either remain attached to a substratum, such as a 
tissue culture dish, or are pelleted (for cells that grow in 
suspension). The cells are treated with a hypertonic primer 
solution, such as a concentration of 0.3M-O.6M sucrose and 
10* PEG io either Tris-HCl or HEPES (pH 7.2) buffered 
solution, for 3-5 rains at room temperature. The primer 
solution then is removed- 
After the DNA-polypepude complex is formed, it is made 
hypotonic. The complex solution is hypotonic when it has a 
lesser osmotic pressure than a 0.15M or 0.9^ solution of 
NaCl. For example, the complex in isotonic buffer can be 
made 40-55% hypotonic or 0.075M simply by adding an 
amount of distilled water that is equal to the volume of the 
complex in isotonic buffer. The hypotonic complex solution 
then is added to the cells that have been treated with the 
primer solution. Cells remain in the hypotonic DNA- 
polypeptide solution for 3-4 minutes. Fresh medium then is 
added to the cells to rinse away excess DNA- polypeptide 
solution. Thereafter, the cells are grown normally. 

Producing a Synthetic Polypeptide Molecule 

An example of an synthetic polypeptide molecule of the 
•present invention is one consisting of the amino acid 
Sequence PKKKRKVSGGGGGKKKKKKK 
ID NO:56). Such a peptide can be synthesized, using stan- 
dard methods of peptide production, and purified by stan- 
dard methods using high pressure Liquid chromatography 
(HPLC). 
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Selection of Stably Transfected Cell Lines 

Transfected cells are grown id regular growth medium for 
48 hours, and then plaied in selective medium containing 
400 pg/ml of G418. Cells were plaied at a density of 
100- 1000 cells per 60 cm 2 dish. The number of C418- 
resistaol colonies was determined two weeks after the ini- 
tiation of selection. Other selectable markers, such as pHyg. 
may be used to achieve the results of the instant invention. 
K. Blochlinger. et al.. MoL CelL BioL 4: 2929 (1984). 

This method gave a stable-transfection efficiency of 
5-10%. Similar results were obtained using either G418 or 
hygromycin selection. In general the stable transfection 
efficiency achieved by the method of the instant invention is 
a few orders of magnitude greater than prior art methods. 
The instant invention^- 10% efficiency is several orders of 
magnitude better than the efficiency of the DNA-CaP0 4 
co-precipitaUon method and at least equal or 5 times greater 
than the fairly high 1-10% level of stable transfection 
efficiency achieved by viral based methods. 



TABLE 0 



Method 


Stabk Trxmfcctioo Efficiency 


Peptide -Melted 


5-10% 


Gene Transfer 


1-10% 


Retrovir*! vecton 


Noo-riral methods 


<2% 


(eg, C »P0 4 ) 





That the transfectants of the instant invention are stable is 
shown by the following example. When G418 resistant 
colonies were grown without selection for variable period of 
times, and then tested for resistance to the antibiotic by 
plating the cells under clonal conditions, the same number of 
colonies were obtained both with and without G418. This 
result indicates that, once the cells are selected for the 
expression of the Neo gene, the resistance gene was retained 
stably in the chromosome. 

Three different cell lines were used to test the efficiency 
of gene transfer of the new method. Mouse fibroblast cell 
line (L cells), mouse erythroleukemia cell line (C19TK). and 
COS cells. The COS cell line was used to establish condi- 
tions for transient gene expression. The eukaryotic expres- 
sion vector. CHI 10. contains bacterial fj-gal and was 
employed in these studies. The f$-gaJ gene in CH 1 10 is under 
the control of SV40 virus early promoter. 

The COS cells were treated with primer and men exposed 
to DNA-poIypeptide (2.5-5.0 pg) complex under hypotonic 
conditions. After this treatment, cells were returned to the 
normal growth condition. Transfected cells were grown at 
37° C. for 48 hrs, and stained for the expression of the £-gal 
reporter gene. Forty to fifty percent of the cells were positive 
for the expression of the reporter gene. 

Mouse L cells were transfected with eukaryotic expres- 
sion vector containing the Neo gene, which codes for the 
antibiotic G4 1 8 resistance gene. L cells arc sensitive to G4 1 8 
at 400 ug/ml Cells plated in 24- well tissue culture plates 
were then transfected with synthetic polypeptide complexed 
to the plasmid pRS V-Neo via the methodology of the present 
invention. 
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^^^A mouse crythroleukcmia cell line. C19TK. also w^^ed 
as a representative cell line for testing the transfection 
efficiency of the present invention with respect to hemato- 
poietic cells. The expression vector. pDR2. which carries a 
hygromyrin-resistance gene, was used for these studies. 
CI9TK cells are exquisitely sensitive for the antibiotic 
hygroraycin. This cell line grows in suspension and. hence, 
was transfected in suspension. 

Briefly, about million cells are spun down and the cell 
pellet is treated with primer. The cells are then exposed to 
DNA-polypcptide complex under hypotonic condition. 
Forty-eight hours after transfection. a known number of 
cells are plated in microti ter plates with hygromycin. The 
number of wells with growing population of ceils was 
enumerated to determifie the transfection efficiency. The 
stable transfection efficiency was about 1-5%, as compared 
to most of the other non- virus-based methods that are very 
poor. Thus, the method described herein is very efficient for 
stable transfection efficiency both for hematopoietic and non 
hematopoietic cell lines. Only some retrovirus based vectors 
give a transfection efficiency comparable to the efficiency 
obtained with the current method for hematopoietic cell 
lines: See GUboa. et al (1986). Miller, et aL (1986). 
StuMraann. et al. (1989). Miller, et al. (1989). and ZwiebeL 
et aL (1989). each cited above. 

EXAMPLE 2 

Transfer of Genes into Human Primary Cells 

The gene transfer method of the present invention was 
used to generate extended life cell lines from different 
human primary cells. Most of the primary cells have a 
limited in vitro life span. The following cell types were 
employed to test the efficacy of the inventive method to 
generate extended-life cell lines by transfer of various 
oncogenes, either singly, in pain of combinations, or com- 
binations of more than two oncogenes. Rhim. J. S.. et aL. 
Oncogene 4; 1403 (1989). 

EXAMPLE 3 
Production of Extended Life Cell Lines 
The method of introducing genes into primary cells is the 
same as that described above for introducing genes into 
established cell lines, such as the mouse fibroblast cell line 
L cells and the mouse erythroleukemia cell line C19TK. The 
main difference is that the host cell is a primary cell isolated 
from different species, human or other mammalian species, 
and the primary cells have only a limited in vitro life span. 
The isolation of primary cells from various tissue sources 
are well known to those of skill in the art 

In order to extend the life of primary cells that arc 
endogenously incapable of extended growth in vitro, the 
cells are transfected with different oncogenes, such as SV40 
large T antigen, polyoma large T antigen, adenovirus El A 
and E1B. v-fms. Bcl2. myc and ras. The oncogenes can be 
used either alone, in pairs of various combinations, or in 
combinations of more than two oncogenes. 

In addition, other genes that do not come under the 
category of oncogenes may be used. For example, genes that 
are important for DNA synthesis and normally active during 
the S phase of the cell cycle, such as the dihydrofolale 
reductase Rene (DHFRV thymidine kinase eenr. rhvmWvbfr. 
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synthetase gene, a DRTF1/E2F transcription factor encoding 
DNA. or DNA encoding the E2F transcription factor can be 
complexed to synthetic polypeptide and used to extend the 
life of primary cells. The human DHFR gene complexed to 
synthetic polypeptide can be introduced into primary cells to 
produce extended life cell lines. DNA encoding a transcrip- 
tion factor that is active during the S phase of the cell cycle 
are particularly useful in the method of the instant invention. 
La Tbangue. N. B. Trends in Biochemical Sciences 19: 108 
(1994); Johnson. D. G. et aL. Nature 365: 349 (1993). the 
respective contents of which are hereby incorporated by 
reference. 

Because untreated primary cells have only a limited life 
span in vitro, their ability to grow continuously in culture 
after treatment with the present invention served to select for 
extended life cell lines. No other drug selection markers 
need to be used to select for extended life cell lines derived 
from primary cells. 

To produce extended life cells lines from primary cells, 
newly cultured primary cells were treated by the method of 
the present invention employing synthetic polypeptide con- 
jugated with various oncogenes, such as SV40 large T 
antigen and/or Adeno E1A. The treated cells were plated in 
their appropriate growth media and passed after the cells 
reached confluency. A parallel scl of a control untreated 
primary cells were cultured under the same growth condi- 
tions. Typically, control primary cells stop growing after 
about 4—10 passages, depending upon the cell type (cell split 
ratio was usually 1:4 by surface area). In contrast, continu- 
ously growing cell lines were obtained from different pri- 
mary cell types described in the following examples. 

EXAMPLE 4 

Analysis of Transformed (Extended Life) Cells 

Extended life cell lines containing the oncogene are 
identified by restriction cleavage. Southern analysis and/or 
Northern analysis using appropriate DNA probes. 

The DNA of each transformed extended life cell line is 
analyzed by Southern hybridization to determine whether 
the cell lines carry the oncogenes used to establish such 
extended life cell liDes. DNA is extracted from the cell lines 
and the nucleic acid pellet is re -suspended in 200 ul of 10 
mM Tris-Cl pH 7.4. O.i mM EDTA, and 10 pg is digested 
with a specific restriction enzyme, clectrophoresed through 
1.0% agarose, and transferred to nitrocellulose. Southern. J. 
Mol. BicL 98: 503 (1975). Filters are hybridized to a 
radioactively labelled DNA. encoding each of the oncogenes 
that gave rise to the corresponding extended life cell line, in 
the presence of 10% dextran sulfate. After overnight 
hybridization, the fillers were washed twice in 2 X SSC. 
0.1% SDS at 64 c C 

Each transformed extended life cell line is analyzed by 
Northern hybridization to determine whether the cell lines 
transcribe the oncogenes. Cells not containing the oncogene 
of interest will not demonstrate transcripts in a Northern 
analysis whereas cells containing the DNA of interest will 
demonstrate a detectable transcript. Also, an ELISA method 
was used to detect the presence of oncogene products in 
some of the extended life cell lines, using publicly available 
antibodies that recognize the corresponding oncogene pro- 
tein. 
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The presence of SV40 large T antigen and adenovirus 
E1A gene products in the HUVEC extended life cell line, as 
delected by ELISA. are shown in table 2. Briefly, the cell 
line grown in a 96 well tissue culture plate is fixed with 
glutaraldehyde and paraformaldehyde. The cells are then 
treated with antibodies to the corresponding oncogenes. 
Thereafter, the cells are washed and then treated with a 
secondary antibody linked with to p-galactosidase. The cells 
arc washed and then treated with a substrate for 
p-galactosidasc. The reaction develops a product which is 
then measured using a raicroplate reader. 

To determine whether the extended life cell line has 
maintained the parental cell line phenorype may be deter- 
mined by a number of ways. - Extended life cells lines 
containing the oncogene are assessed by Northern analysis 
using a DNA probes encoding a cell-specific protein. The 
cell-specific DNA probe is labeled with >2 P-dCTP by nick 
translation pursuant, for example, to' Rigby el al„ J. MoL 
BioL 113: 237 (1977). Northern hybridization indicates that 
the extended life cell line is capable of transcribing the 
cell-specific protein. 

Also, the maintenance of the parental phenorype in cells 
lines established according to the present invention can be 
determined by a number of biochemical methods, such as 
ELISA and enzyme assays, that determine the presence or 
function of a protein specific to the parental cell line. An 
antibody recognizing a protein produced only bytbe parental 
cell line can be used in an ELBA of immunofluorescence 
assay. Cell-specific markers are well known to those of skill 
in the art For example, albumin is a marker for hepaiocytes. 
insulin is a marker for pancreatic beta islet cells, factor VIE 
is a marker for endothelial cells, actin and myosin are 
markers for smooth muscle cells, and non-specific esterass 
is a marker for brain microglial cells. In Table IL the parental 
phenorype of the extended life endothelial cells produced by 
the present method of the invention was verified by several 
Eos As to determine the expression of cell-specific endot- 
helial markers. The parental phenorype of the monocyte/ 
macrophage extended life cell lines produced by the present 
method was verified using a lysozyme enzyme assay to 
measure macrophage specific markers. 

EXAMPLE 5 

Human Umbilical Vein Endothelial Cells 

Endothelial cells isolated from the human umbilical vein 
can only be cultured for a limited of passages, usually five 
to six. These cells were transfected with a combination of 
oncogenes. SV40 large T antigen and adenovirus E1A. or 
with another combination of genes. At least two oncogenes 
are needed to develop a truly transformed cell line. Ruley. H. 
E.. et aL Nature 30*: 602 (1983). For the instant invention. 
SV40 large T antigen combined with v-rayc or ras or some 
an other oncogene can be used. When the gens encoding 
SV40 large T antigen is combined with either adenovirus 
E1A or E1B genes in the method of the instant invention, 
extended life cell lines may be produced from human 
umbilical vein endothelial cells. El A or E1B or SV40 large 
T antigen alone did not give rise to established ceil line with 
the high frequency obtained from using SV40 large T 
antigen in combination with E1A or E1B. Synthetic 
polypeptide complexed to DNA encoding either the SV40 
large T antigen or polyoma large T antigen combined with 
the E2F1 transcription factor gens also produces extended 
life HUVEC cells lines. 
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• • 

Since the non-transfected primary cells normally grow in 
vitro only for a limited population doublings, cells that have 
taken up the oncogenes capable of generating extended life 
span cell lines were selected simply by repeated passage of 
the cells. When the transfected population of cells grows 
continuously, as compared to a control population of paren- 
tal cells, it is reasonable to conclude that the oncogenes used 
arc capable of generating extended life cells from a given 
ceil type. 

In HUVEC for example, SV40 large T antigen and 
adenovirus ElA or E1B were effective in giving rise to a cell 
line. This cell line has now been growing in culture for 40 
passages. In contrast normal HUVECs stop growing by 
passage 7 or 8. Such cell lines arose with a high efficiency. 
It also is possible to generate cell lines using as few as a 
couple of hundred cells, grown either in a 24- or 48-well 
plate. These cells also have the same morphological appear- 
ance as the primary HUVEC and also display many of the 
biochemical properties characteristic of normal HUVEC. 

Some of the properties thai are characteristic of endothe- 
lial cells that were measured in the HUVEC extended life 
cell line are also listed in Table 2. These properties were also 
measured by F.I ISA using specific antibodies listed in the 
Table 2. 

TABLE m 

ELISA uuy for the expression of ELAM-1. VCAM-1, 
1CAM-1, SV40 large T antigen urf 
adenovirus ElA by ejicockd life HUVEC line 

QP m 



Antibody -JL-1 +IL-1 



Control 


0.071 


o.o69 


Ann ELAM-t 


0.212 


1.016 


Anti VCAM-l 


0.146 


0.520 


AntilCAM-l 


0.422 


1.524 


Anti SV40 large T 


0.618 




And ElA 


0.725 





EXAMPLE 6 
Human Cord Blood-Derived Monocyte Cell Line 

Adherent cells from human cord blood cells were trans- 
fected with different combinations of oncogenes in suspen- 
sion using the method of the present invention. The resulting 
cells are selected in Granulocyte-Macrophage Colony 
Stimulating Factor (G-CSF). Control cells did not grow in 
culture, whereas growing populations of monocytes were 
obtained with several combinations of oncogenes. One 
preferred combination of polyoma large T antigen and 
adenovirus E1B encoding DNA produced extended life 

monocyte cells lines with somewhat higher efficiency than 
other combinations. Another preferred combination of SV40 
large T or polyoma large T antigen and the E2F1 transcrip- 
tion factor gene produces monocyte extended life cells with 
high efficiency. The monocyte extended life cells also dis- 
play many of the properties of normal monocytes, which 
illustrates the utility of the present invention in generating 
cell lines of hematopoietic origin. 
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EXAMPLE 7 



Extended Life Human Aortic Smooth Muscle Cells 

The method of the instant invention has also been used to 
generate extended life cell lines using a specific combination 
of oncogenes. Human aortic smooth muscle cells were 
obtained from Clonetics Corporation (San Diego. Calif. 
U.S.A.) and trans fectcd with several combination of onco- 
genes. The combination of polyoma large T antigen and EiB 
gave rise to a continuously Rowing population of smooth 
muscle cells. Another preferred combination of SV40 Large 
T or polyoma large T antigen and the E2F1 transcription 
factor gene produces extended life human aortic smooth 
muscle cells with high efficiency. This cell line resembles the 
early passage primary aortic smooth muscle cells morpho- 
logically. The extended life human aortic smooth muscle 
cells also express smooth cell actin and myosin well beyond 
passage 20. 

EXAMPLE 8 
Other Extended Life Cell Types 

Primary cells from other species, such as rabbit and 
monkey, also have been used to generate cell lines. Trans- 
fection methods employed for primary cells from non- 
human species are similar to those used for human primary 
cells. When developing an extended life cell line from a new 
primary cell, several different combinations of available 
oncogenes should be tried. For example, at least five or six 
pairs of combinations of SV40 large T antigen, adenovirus 
E1A, adenovirus EIB. polyoma virus large T antigen or 
others available to those in the art. That combination of 
genes that gives rise to an extended life cell lines from a 
given primary cell type is determined as described in the 
above examples. 

When the E2F1 transcription factor gens is cornplexed to 
synthetic polypeptide in combination with DNA encoding 
either the S V40 large T antigen or polyoma large T antigen, 
extended life cells lines can be produced from a variety of 
primary cell types, such as HUVEC. dermal microvascular 
endothelial cells, human aortic smooth muscle cells, and 
bone marrow nx>nocyte/niaaophage cells. Thus, the method 
of the present invention can identify a combination of 
oncogene DNAs that is highly efficient in producing 
extended life cells lines from the primary cells of various 
species. The present invention also comprehends a combi- 
nation of an oncogene and an S-phase transcription factor 
gens which likewise is highly efficient in producing 
extended life cells lines from different types of primary cells. 

EXAMPLE 9 

Identification of Cell Type-Specific Transcriptional 
and Translational Regulatory Sequences 

The present invention provides a screening system for 
identifying sequences that influence the expression of cloned 
genes in various primary cell types from different species. 
The instant invention can identify cell type specific tran- 
scription and translational regulatory sequences. The 
sequence in question typically will be cloned into a vector 
containing a reporter gens, such as chloramphenicol acetyl 
transferass or lucif crass, and then transfected into various 
cell types using the method described herein. Expression of 
the reporter gens determines the tissue specificity of the 
regulatory sequence. 

• is s .* . 



SEQUENCE LISTINO 



< I > GENERAL INFORMATION: 

( i i i \ NUMB EH Of SBQUENCES: 56 

( 2 ) INFORMATION POR SEQ ID NO: I: 
( i ) 



( 2 ) INFORMATION FOR KEQ ID NOJ: 



( : ) INPORMATION FOR SEQ ID 




( 2 ) INFORMATION FOR SEQ ID NO:4: 



I t i ) SEQUENCE DESCRIPTION: SEQ ID NO:*: 



Ala Alt Pbc Olo A l 

I 5 



Leo Arg V«I A i s Set 
I 0 



( 2 ) INFORMATION FOR SEQ ID NO 3: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: * amnD acids 
( B ) TYPE: aomao acid 
( C ) STRANDEDNESS: lagfe 
< D > TOPOLOGY: line* 

( i i > SEQUENCE DESCRIPTION: SEQ ID NO- 

Pro Arg L j i Arg 
I 



16 



0 



< 2 ) INFORMATION FOR^B^ID NO^: 

( i ) SEQUENCE CHARACTERISTICS: 
1 A ) LENGTH: 9 *mux> add* 
fiB ) TYPE: modo acid 
(IC ) STRANDEDNESS: ua«fe 
{ ID ) TOPOLOGY: t*c« 



(ii )5E 



2 DESCRIPTION: SBQ ID NOt6: 



V • 1 S t rl A t | Lfi Art P t a Art Pro A 1 * 

3 



( 2 ) INFORMATION FOR SBQ ID NO-.7: 

( i ) SEO^EN^CHAAACTEWSTICS: 
( A ) UENOTH: 6 van acids 
( B ) TyfPE: atxcao acid 
( C ) STRANDEDNESS: ncgW 
( D ) IX^OLOOY: linear 

( I i j SEQUENCE DESCRIPTION: SBQ ID NO:7: 



< 2 ) INFORMATION FOR SBQ ID] 




( 2 ) INFORMATION FOR SBQ ID NO* 



I i ) SBQUENCE CHA^ACTE^jSTICS: 
( A » LENGTH: 7 acdao acid* 
( B ) TYPE: acmso arid 
( C )STRAND£DNESS: n=*fe 
{ O ) TOPOLOGY: tuLar 

( i i > SEQUENCE DESCWPTIOnJ SBQ ID N09: 

Pio Met Lji' Lp Lp A r g Lp 
I 1 5 



< 2 > INFORMATION FOR SBQ ID NO: 10: 

( i ) SEQUENCE CHAJIACTEWSTICS: 
( A ) LENGTH: 17 KBM mot 
< B ) TYPE: msmno acid 1 
( C ) STRANDEDNESS: lr*j 
( D ) TOPOLOGY: tiat*: 



( x i > SEQUENCE DESCRIPTION: SEp ID NCMO: 
A I 



A r | Pro At 
I 

A • p 



( 2 > INFORMATION FOR SBQ ID NO:!!: 



br Ljrt Ly i Ala Ol y Ola A 

I 0 



( i ) SEQUENCE CHARACTERISTIC^: 

< A ) LENGTH: 3 
( B > TYPE: 
I C > STRANDEDNESS: 

< D ) TOPOLOGY: ] 



CTERJSTKK: 
eDNESS:«A«fe 



( x i ) SEQUENCE DESCRIPTION: SBQ ID NO: 1 1 : 

V; 

5 \ 
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f 2 ) INPORWADON FOR $J 

( i ) SEQUENCE CHARACTERISTICS: 
( At) LENGTH: 19 dm 

< all) TYPE, maao m! 

< CW STRANDEDNESS: • 
( dBtOPOLOOY: L-t* 




j description: sbq a> Nan: 



Af| V.l YTbr M« At| Tbi V.l Ar ( V»| Art Ar« Pro Pro Lyt Oly Lyi 



Hit Ai| 



( 2 > INFORMATION FOR S! Q tD NOD: 

( i ) SEQUENCE HARACTERJSTICS: 
( A ) LENGTH: 7 
( B » r4pE: mom acid 
i C > 

( D >TOfOLOOY: 



I 0 



1 5 



7 • 



( t i )SBQUENCE 
Oly L y i L jr 




ON: SBQ ID NO-.U: 



» Arg Scr Ly i Ala 

5 



< 2 ) INFORMATION FOR SEQ ID NO-.14: 



< i > SEQUENCE < HARACTERJSTiCS: 
( A ) LI NOTH: 7 cnoo adds 



( i i ) SEQUENCE 



I B 1 Ti PE: msimo acid 

( C ) STRANDEDNESS: mgk 

( D ) TC POLOOY: &M* 



ON: SEQ ID NO: 1 4: 



( 2 ) INFORMATION FOR SEQ IjD NO: 15: 




( x i ) SEQUENCE 

A » p Arg Lt 
1 



( 2 ) INFORMATION FOR SBQ ID 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 6 axoao acida 
( B ) TYPE: rnxsoo acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY \tacM 

( i i ) SEQUENCE DESCRIPT ON: SBQ ED NO 16: 



\ • Ai | L y i 

S 



Pro Ly i Ola L 
t 



( 2 ) INFORMATION FOR SBQ ID NOT 



( i ) SEQUENCE CHARACTERISTICS: 
( A )UNOTH:7i 

< B >TYPE: 

( C ) STRANDEDNESS: tagfe 

< D ) TOPOLOOY: 1 

( x i ) SEQUENCE D£5CRJPTlON| SEQ CD NO. 1 7: 
Vi | Arg Ly f L y t A r | Ly* Tbr 



1 
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( 2 ) INFORMATION FOR SBQ ID NO. IB: 

f i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 7 maao ndi 
( B ) TYPE: amuo k*1 
( <J ) STRANDEDNESS: n«k 



( 2 ) INFORMATION FOR SEQ ID NO-.l* 




( 2 t INFORMATION FOR SBQ ID NO-JO: 

i i ) SEQUENCE CTlAJlUcTERJSTlCS: 
( A > LENGTH: 10 crinn iddt 
( B ) TYPE: ^mao «nd 
( C )S 

( D > TOPOLOGY: 
{ i i > SEQUENCE DESCRpTION: SEQ 05 Na30: 
Atg Ola Arg 



Ai| Asa 01 « Leu L j i Ser Phe 

5 1 O 



( 2 ) INFORMATION FOR SBQ ID NqSl 

( i ) SEQUENCE CHARA< 
( A ) LENGTH: 
( B ) TYPE: 
( C ) 
( D) 

< x i ) SEQUENCE 
Tbr Lyi L r • A i 




txugfc 



SBQ ID NO-.21: 



Lf i Leu G I o 
5 



( 2 ) INFORMATION FOR SEQ ID NOt22: 

t i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 7 a 
{ B ) TVFE: jcuo Kkl 
< C ) STRANDEDNESit <a«k 
( D ) TOPOLOOY: 6atA 

Hi) SEQUENCE DESCRIPTION: SBQ ID N022: 



Pro Ly i Tbf Ar 
t 



* V 



ry Ai | Pro 



( 2 > INFORMATION FOR SBQ ID N023: 



\ 



( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 7 tnzm — 
( B ) TYPE: maao *cid 
( C ) STRANDEDNESS: tia, 
i D) TOPOLOGY: hmtm 

{ I i ) SEQUENCE DESOUFTION: SBQ nShJOJS: 

Sir O I • A r » L r i Ar || Pro Pro 
I 5 



19 



2 ) IhTPORMATION FOR S£Q 



i i > 



( 2 ) INFORMATION FOR 

( i ) 




tDNOO* 



i CHARACTERISTICS: 
[ A ) LENGTH: 12 msm andt 
; B ) TYPE: aoouo acid 
(, C ) STRANDEDNESS: «mgta 
(\6 ) TOPOLOOY: laws 

DESCRIPTION: SEQ ID N034: 



Ar | L • o\ P r o V a 1 At | Ar« Ar| Ar| At | Art V * I Pro 
I 



ID N025: 



( x i > SEQUENCE 



CHARACTERISTICS: 
( A ) l£NOTH; 5 anaoo _a=kb 

( B > ' YPE: mxboo acid 

t C > ITRANDEDNESS: >«filc 

( D ) TOPOLOOY: tnrai 



RJPTION; SEQ ID NO:25: 




f 2 > INFORMATION FOR SEQ ID NOS6: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 i 
( B ) TYPE: kmoo acid 
{ C > STRANDEDNESS: tingle 
f D t TOfOLcVoY: bat* 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NC06: 

V»| Ar| Tbt Tibr Lyi G I y Lyi Arg Lyi A r g tie Aip V a 
I \ 5 10 



( 2 > INFORMATION FOR SEQ ID NOi 

{ i ) SEQUENCE CHARACTERISTICS: 
< A > LENGTH: 5 i 
( B )TYPB: amino Vid 
{ C > STRAhfDEDNtfSS: tm&c 
( D > TOPOLOOY: I 

( x i ) SEQUENCE DESOUPTIC+A SEQ CD NO. 27: 





( 2 ) INFORMATION FOR SEQ ID NO:3: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: S amino ~ J - 
( B ) TYPE: asiao acid 
( C ) STRANDEDNESS: 
( D ) TOPOLOOY: (bear 

(Mi) SEQUENCE DESCRIPTION: ! 

Arg At | At* A f g Af ' 

1 5 



AID NO--28: 



Arg Arg I f p 



( 2 ) INFORMATION FOR S8Q ID NO£9: 



< ■ ) SEQUENCE CHARACTERISTICS: \ 
< A ) LENGTH: 15 amiao 
( 8 ) TYPE: amiao acid 
( C > STRANDEDNESS: a 
( D ) TOPOLOOY: Eaaar 



( x i ) SEQUENCE DESCRIPTION: SBQ ID NO-Jfc 

Pre Arg Scr O I y L y t LW i Arg Lyi Arg L y • Arg Leo Ly» Pro Tbr 
1 5 \ 10 IS 



20 



( 2 ) INPORMXDON FOR SEQ ID NOOCfc 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 5 m~ : 

( B ) TYPE: ao£*o acid 
( <T ) STRANDEDNESS: vn&c 
( q )TOPOLOOY:I 

< * i ) 



| 2 ) INFORMATION FOR ID NOJ I: 



f 2 ) INFORMATION FOR 



( 2 INFORMATION FOR SBQ irANO-JO: 




( 2 ) INFORMATION FOR SBQ ID 



L y • It* 11* Ola 
3 



< 2 ) INFORMATION FOR SBQ ID NO-J3. J 



© NO-33: 
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( 2 ) INFORMATION k>R SJBQ CD NO-J6: 

( i > SEQUENCE CHARACTESUSTKS: 
( A ) UENCTTH: 7 mud aatfe 
t B ) TYPE: Kiiao «d 
( C ) STRANDEDNESS: «m«Jo 
( D\) TOPOLOGY: fiawr 

(si) SEQUENCE DESCRIPTION: SBC* ID NC06: 



Pro Qlt 
I 



t : ) INFORMATION FOR SEQ ID NOJ7: 



Pro Lyi L» i Lyi Pro 

5 




P fa e L 7 i 
I 

Arf At | 



< D ) TC POCOOY: Hnur 
( i i > SEQUENCE DESCRIPTION. SBQ ID NOJ7: 



Lyi Hit Lyi Lyi A i p lie Ser Olo Aid Lyi Arg All V » | 

5 10 M 



( 2 > INFORMATION FOR SEQ lraNO-38: 

I i ) SEQUENCE CHAJ^CTERJSTICS: 
\ A ) L£NOTH: 9 ttuo adds 
( B ) TYPE: ambo acid 
( C ) STRANTJEDNESS: ria$k 
( D ) TOPOLOGY: tec* 

Mi) SEQUENCE DESCRIPTION: SEQ CD NOJ8: 

Ser Lp C y i lieu OI y Trp Leu Trp Gt 7 
I \ 5 



( 2 ) INFORMATION FOR SEQ ID NOO?: 

( i ) SEQUENCE CHARAC 

< A ) LENGTH: 

< B ) TYPE: i 
( C ) STRA.V 
( D ) 70POCOOY: 1 

(»i ') SEQUENCE DESCR1PT10N| SEQ ID NO J* 

G 1 y Lyi Aig LyilAu Lyi Pro Lyt 
1 l* 



( 2 ) INFORMATION FOR SEQ ID NO-«>: 

( i ) SEQUENCE CHARACTERS 
( A » LENGTH: 7 
( B > TYPE: annuo acid 
( C ) STRANDEDNESS: 
< D ) TOPOLOGY: hacM 

( * i > SEQUENCE DESCRIPTION: SEQ it NO:«0: 

Lyi Tbr Arg Lji H i f \ A r g O I y 
I 3 



\ 
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( 2 ) INFORMATION FOR SBQ ID NO:4I: 



( 2 ) INFORMATION FORI SBQ ID NO:«2; 



Arg Ar| lAq A I * G I 0 A > a At | Olu Ala At* A l g L r » Ser A f g Leu A r j 
5 10 15 




( 2 > INFORMATION FOR S5Q ID) NO O 



L y i L t > A r g lA 1 « At | Leu Vi | Arg All Ar | Olu Scr Ala O 1 o Lcd Str 
1 I 5 10 15 



( 2 ) INFORMATION FOR SBQ ID NOl44 



( 2 ) INFORMATION FOR SBQ ID NO. 45 

( i ) SEQUENCE CHARACTEldsnCS: 
( A > LENGTH: 18 a^a 
< B ) TYPE: 

( C ) STRANDEDNES^: tingfc 



< D ) TOPOLOGY: I 
(li ) SEQUENCE DESCRIPTION: SBQ ID NO<5: 



Ar | L j i Arg L j 
I 



I I o Scr Aid Arg Olu Set Ala Arg Atg Scr Arg T?r 
y\ 10 15 



Arg L jf • 
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( 2 ) rNFDRMATtC^ PC* SEQ H> NCM6: 



CHARACTERISTICS: 
( A ) LENGTH: U i 
( B >TYFE: 

;( C ) STRANDEDMESS: 
D ) TOPOLOOY: Eat« 

DESCRIPTION: SBQ ID NO«6: 



DESCRIPTION: SEQ ID NO.*T: 

Ly > A r | IP i o Aip Glu Aip A > p Aip O I j Glu Pio Set Glo Aig Lyi A r ] 




o L71 Hit L71 L • o Lji M • t L 7 • r « Ly 1 



Pro Glo Atp Arg Tyr lie Oil Tbr Ola Ly 1 Ty r Oly Arg 
5 10 I $ 



j t O I y Ser Atp Tbr O I u tic L y 1 Leo Lea Set Ly 1 

5 10 15 



Pro Ali Alt V • 1 Scr Ser Ala Vil 

10 15 



Val O I 7 A » p Pro 

: 0 



24 



( 2 ) INFORMATION 
( i ) 



< A ) LLNGTH: 
AB ) TYPE: kim acal 
( V ) STRANDEDNESS: n«k 
( O ) TOPOLOGY: li»c« 

\ 

( t i ) SEQUENCE DESCRIPTION: SSQ ID NOSI 



A » a Ala 11* Leo Ar| Art Ljr « Ln 

5 1 0 



( 2 ) INFORMATION FOR 




O I i Ola Atp Lea Ola 
I 5 



{ t i ) SEQUENCE DESCRIPTION: SBQ ID NOS2: 



Oly A 
1 



< : > INFORMATION FOR SBQ IDiNOJJ: 



•p Ari At | At* All Pro At* Arg P r < 

\ 3 10 

KM 

( i ) SEQUENCE CHAR^CTEJUSTICS: 
{ A ) LENOTH: 10 kxuo acxfc 
{ B > TYPE: ecuo acid 
( C ) STRAND£DNESS: nagje 
( D ) TOPOLOOY: li« 

« i > SEQUENCE DESCRIPTION. SBQ ID NOi53: 

Met S e r Gin Ar | Ly i Arg Arg Olu L y » Lea 
5 I 0 



( 2 > INFORMATION FOR SBQ ID NO-34: 



I i ) SEQUENCE CHARACTERISTICS: 

< A > LENOTH: 15 «nbo *ad* 

< B ) TYPE: amoo acid 

( C > STRANDEDNES5: imak 
( D ) TOPOUX)Y|tM 

(si) SEQUENCE DESCRIPTION: SBQ ID t*>M: 



Mc c Me Scr O 1 
I 



( 2 > INFORMATION FOR SBQ ID NOJ5 



o At* Lea Arg L y * Ala lie Ol ; L j i Arg 

5 1 0 



( i ) SEQUENCE CHARACTERISTICS: 
( A > LENOTH: 16 mn*o *d<b 

< B ) TYPE: azuo acid 
I C > STRANDEDNESS: tn*te 
( D ) TOPOLOGY: UaeU 

( x i ) SEQUENCE DESCRIPTION: (SBQ CD NO:55: 

Ly i Arg Pro A I i A 1 i Tbr L 7 • L j » Ala G I 7 Ola A t 1 L 7 » L r • Ly 1 L y 

I ||$ 10 15 

{ 2 ) INFORMATION POR SBQ ID NCh56: 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 25 «=i*p an* 
( B ) TYPE: aonjao arid 

( C > STRANDEDNESS: trrgW 
( D )TOPOLOOY: Lac« 

( x i ) S8QUENCE DESCRIPTION: SBQ CD NOJ6: 

Pro Ly» L y » Lyi i\rg Lfi V • I Ser Oly Oly Oly Oly Gly Lyi L y a Ly 

I 5] 10 15 

Ly» Lyt Lyt Lyi L y » Lr 1 Ly» Lyi Lyi I 

10 \ » J 
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